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Abstract. For a rational matrix function <& with poles outside the unit circle, 
we estimate the degree of the unique superoptimal approximation _4$ by matrix 
functions analytic in the unit disk. We obtain sharp estimates in the case of 
2x2 matrix functions. It turns out that "generically" degA& < dcg$ — 2. 
We prove that for an arbitrary 2x2 rational function <3>, deg„4$ < 2deg<l> — 3 
whenever deg$ > 2. On the other hand, for k > 2, we construct a 2 x 2 matrix 
function <&, for which deg$ = k, while deg^4$ = 2k — 3. Moreover, we conduct 
a detailed analysis of the situation when the inequality deg A§ < deg $ — 2 can 
violate and obtain best possible results. 

1. Introduction 

The problem of uniform approximation of a given function on the unit circle 
T by functions analytic in the unit disk is a classical problem of approximation 
theory that is very important in various applications (see [Pe]). It is well known 
(follows from a compactness argument) that for a function <p> in L°°(T) there is 
always a best approximant / G H°°: 

\\v - /||oo = disti/x>((£>, H°°). 

Though in general a best approximant does not have to be unique, under certain 
natural assumptions on <p we do have uniqueness. In particular, uniqueness holds 
for continuous functions (p; this was established for the first time in [Kh] and then 
rediscovered by several other mathematicians. 

A function ip 6 L°° on T is called badly approximable if the zero function is a 
best approximant of <p. It is well known (see e.g., [Pe], Ch. 7, §5) that a nonzero 
continuous function is badly approximable if and only if it has constant modulus 
and negative winding number with respect to the origin. This criterion can be 

extended to functions in QC = H°° + Cf)H°° + C, see [Pe], Ch. 7, §5. If <p 
is a nonzero rational function, then ip is badly approximable if and only if it has 
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constant modulus and 



degP+y? < degP_y? (1.1) 

(this follows from the above criterion and the argument principle). 

It is even more important in applications to consider the problem of analytic 
approximation of matrix functions on the unit circle; in systems theory scalar 
functions correspond to single input - single output systems, while matrix-valued 
functions correspond to multiple input - multiple output systems (see e.g., [Fu], 
[Fr], [Pe]). 

It turns out, however, that the situation with best analytic approximation of ma- 
trix functions is quite different from the scalar case. Suppose that $ £ L°°(M m n ), 
i.e., $ is a bounded measurable function with values in the space M mi „ of m x n 
matrices. A matrix function F in H°°(M. mtn ) is said to be a best approximant of 
$ if 

||$ - F\\ Loo = inf{||$ - Q\\ Loo : Q £ H°°(M m , n )}. 
Here for a function \& in L°°(M mjn ) we use the notation 

|| £oo d = esssup ||*(C)||M m ,„- 

However, in the matrix case even for polynomial matrix functions a best approx- 
imant is almost never unique. Consider the matrix function $ = ( f J . It is 



x , 

easy to see that ||$||l°° = 1- On the other hand, if / is a scalar function in the 



unit 



it ball of H°° then the function ^ jj ° ^ £ H QO (M 2t2 ) is a best approximant 
of <I>. 

In the matrix case it is more natural to impose additional requirements on a 
best approximant to be able to choose among best approximants the "very best" 
one. Recall that for a matrix (or an operator on Hilbert space) A the singular 
value Sj(A), j > 0, is, by definition, the distance from A to the set of matrices 
(operators) of rank at most j. Clearly, s (A) = 



Definition. Given a matrix function $ £ L°°(M min ), we define inductively the 
sets Qj, < j < min{m, n} — 1, by 

n = {F £ #°°(M miri ) : F minimizes t = esssup ||$(C) - F(C)||}| 

CeT 

flj = {F £ Qj-i : F minimizes tj d = esssup Sj($(() - F (())}, j > 0. 

CeT 
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Functions in f] fl k = f2 m in{m,n}-i are called superoptimal approximants of <3> by 

fe>0 

bounded analytic matrix functions. The numbers tj = tj(Q) are called the superop- 
timal singular values of $. Note that the set fi consists of the best approximants 
of $. 

We say that $ is badly approximable if 

||$|| L =c =dist L oc ($, H°°(M mjn )) 

and we say that $ is very 6ac% approximable if the zero function is a superoptimal 
approximant of $. 

It was proved in [PY1] (see also [Pe], Ch. 14, §3) that for a continuous m x n 
matrix function $, there exists a unique superoptimal approximant F and 

Sj($(C) - F(()) = tj($), < j < min{m,n} - 1, for almost all (el 

For a continuous matrix function $, we denote by A& the unique superoptimal 
approximant of $. 

Note that rational matrix functions play a special role in applications in con- 
trol theory; they correspond to transfer functions whose minimal realizations have 
finite-dimensional state spaces. Moreover, the dimension of the state space of 
minimal realization is equal to the (McMillan) degree deg $ of a matrix rational 
function <3>, see e.g., [Fu], [Pe] (see the definition of deg<3> in §2). That is why it is 
important to obtain sharp estimates of the degree of a rational matrix function. 

It is well known (see [Pe], Ch. 7, §1) that for a scalar rational function ip the 
following inequality holds: 

deg A<p < deg (p and deg A<p < deg (p — 1 unless G H°° 

(if ip G H°°, then clearly, Atp = ip). 

It was shown in [PY1] (see also [Pe], Ch. 14, § 12) that if $ is a rational matrix 
function with poles outside T, then A$ is also rational. It is a very important 
problem to obtain sharp estimate of the degree of ^4$ in terms of the degree of $. 
In particular, it has been an open problem for over 10 years whether 

deg^$ < deg$. (1.2) 

In this paper we obtain definitive results for 2 x 2 matrix functions. We show in 
§3 that (1.2) holds "generically" . Moreover, generically in the nondegenerate case 
rank($ — A<&) = 2 the following stronger inequality holds: 

deg„4$ < deg$ - 2. 

However, in [PY2] an example of a 2 x 2 rational matrix function $ is given, for 
which rank($ - A<&) = 2, but deg^ < deg $ - 1. 

In this paper we show that actually, the degree of A& can be greater than the 
degree of In § 4 for k > 2, we give an example of a 2 x 2 matrix function $ such 
that deg $ = k, while deg^l$ = 2k - 3. 
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This is the worst possible case. We show in § 3 that if k > 2 and deg $ = k, 
then deg „4$ < 2k — 3. Note that the last inequality was obtained by state space 
methods in [LHG], but under an additional (rather complicated) assumption on $. 
Note also that the authors of [LHG] also obtained (under an additional assumption) 
estimates in the case of rational matrix functions of an arbitrary size. 

Let us describe the results of § 3 in more detail. Let 

(P+$)(2) = and P - $ = $ ~ P+$ ' 

where $(j) is the jth Fourier coefficient of the restriction of $ to T. Consider the 
matrix function \1/ defined by \& = $ — A&. Then \1> is a very badly approximable 
matrix function. Clearly, 

AF_^ = „4P_$ = A§- P + $ = -P+^ 

and it is easy to see that 

deg A<& - deg $ < deg P + ^ - deg P_^ 

(the degree of a rational matrix function is defined in §2). Thus the problem of 
estimating the degree of A<& in terms of the degree of $ reduces to the problem 
of estimating the degree of P + \l/ in terms of the degree of P_\P for very badly 
approximable rational matrix functions 

It follows from the results of [PY1] (see also [Pe], Ch. 14, §5 and § 12) that \l/ is 
a 2 x 2 very badly approximable rational matrix function if and only if the function 
\1> admits on the unit circle T a special (thematic) factorization of the form 

* = (*1 -"2 W*0«0 \f V, V 2 \ 
\w 2 W 1 J \ txUi J \ -V 2 Vi J ' 

where v\, v 2 , w\, and w 2 are scalar rational functions such that |fi| 2 + |t>2| 2 = 
\vjxl 2 + |w 2 | 2 on T, Vi and v 2 have no common zeros in D, and W\ and w 2 have 
no common zeros in D; t > ti > 0; and u and U\ are scalar badly approximable 
rational functions such that |u | = \u\\ = 1 on T. 

We prove in §3 that if W is nondegenerate (i.e., ti > 0) and the inequality 

degP+^ < degP_^ - 2 (1.4) 

is violated, then there is a point A in the unit disk D such that u\ has a pole at A, 

Un has a zero at A, and both v = ( Vl | and w = ( Wl | have poles at 1/A. We 

V v 2 ) V W2 ) 

also show in §3 that for fixed to, uo, u±, vi, v 2 , wi, and w 2 , inequality (1.4) can be 
violated for no more than degP_«i numbers t\ £ (0,t ]. Moreover, we show in §3 
that for \1> given by (1.3), the sum of [degP+\I/ + 2 — degP_\l/] + over all numbers 
ti £ (0, t ] cannot be greater than degP_^i. Here for x E R, we use the notation 

[x]+ = max{x, 0}. 
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In §4 we construct an example of a matrix function of the form (1.3) such 
that inequality (1.4) is violated for precisely degP_«i values of ti G (0, t ]. This 
example also shows that that the above estimate obtained in § 3 is best possible. 

In § 2 we collect information on Hankel and Toeplitz operators, degrees of ratio- 
nal matrix functions, and very badly approximable matrix functions. 



2. Preliminaries 

2.1. Hankel and Toeplitz operators on spaces of vector functions. For 

a matrix function $ in L°°(M min ), the Hankel operator 

H$ : H 2 (C n ) -> #!(C m ) = L 2 (C m ) H\C m ), 

is defined by 

if*/ = P_($/), 

where P_ is the orthogonal projection from L 2 (C m ) onto H 2 i{ ( C m ). Here H 2 (C n ) is 
the Hardy space of C"-valued functions that is identified naturally with a subspace 
ofL 2 (C"). 

It is well known that 

11^*11 = distioo ($,/r°(M m , n )). 

The operator H$ is compact if and only if $ G {H°° + C)(M mi „). 
For $ G L°°(M min ), the Toeplitz operator 

T$ : H 2 (C n ) -> H 2 (C m ), 

is defined by 

T*/ = P + ($/), 

where P + is the orthogonal projection from L 2 (C m ) onto H 2 (C m ). 

If $ is a continuous square matrix function, then T$ is Fredholm (an operator 
T on Hilbert space is called Fredholm if its range is closed and both KerT and 
Ker T* are finite-dimensional) if and only if det $ does not vanish on T. 

We refer the reader to [Pe] for more detailed information on Hankel and Toeplitz 
operators on spaces of vector functions. 

2.2. The degree a of rational matrix function. To define the (McMillan) 
degree of a rational matrix function, we define first Blaschke-Potapov products. 
An nxn matrix function B is called a finite Blaschke-Potapov product if it admits 
a factorization 

B = UB 1 B 2 ---B k , (2.1) 
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where 



Bj(z) = p-^J-Pj + (I - Pj 
J 1 - XiZ 



for some Xj in D, orthogonal projections Pj on C n , and a unitary matrix U. The 
degree of the Blaschke-Potapov product (2.1) is defined by 

k 

deg B = rank Pj. 
3=1 

Let now $ be an m x n rational function without poles in C \ clos D. We say 
that $ has {McMillan) degree d > (in other words, deg$ = d) if there exists 
an n x n Blaschke-Potapov product B of degree d such that $5 G if 00 (M mj „), 
but there is no such Blaschke-Potapov product of degree d — 1. If $ is a constant 
function, we put deg$ = 0. It is well known (see e.g., [Pe], Ch. 2, §5) that deg<3> 
is the rank of the Hankel operator Hq> : H 2 (C n ) — > H^C 71 ) for rational functions 
$ with poles in D and 

deg$ = deg$ t (2.2) 

for an arbitrary rational matrix function $. 

Note that it follows from (2.2) that deg$ = d > if and only if there exists a 
Blaschke-Potapov product B of degree d such that B§ e if 00 (M m)Tl ), but there is 
no such Blaschke-Potapov product of degree d — 1. 

If $ is a rational matrix function without poles in clos D, we say that deg $ = 
deg$t>; where the rational matrix function $t, is defined by $\,(z) = $(l/z). 

For a matrix function $ without poles on the unit circle T we put 

deg<3> = degP_$ + degP + <3>. 
It is easy to see that if for r > 0, the function $ r is defined by 

$ r (z) = ®(rz) 

and both $ and $ r have no poles on T, then deg$ = deg$ r . If $ is an arbitrary 
rational matrix function, we define the degree of $ as 

deg$ = deg$ r , 

where r is a positive number such that $ r has no poles on T. 

For a rational matrix function $ and A G C we define the (McMillan) degree of 
$ at X as the degree of the principal part of ^ at A and we use the notation deg A $ 
for the degree of $ at A. 

Clearly, 

deg$ = ^deg A $. 
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For a subset A of C, we use the notation 

deg A $ = ^deg A $. 
AeA 

It is easy to see that for A C D, deg A $ = x > if and only if there exists 
a Blaschke-Potapov product B of degree x such that B<& has no pole in A, but 
there is no such Blaschke-Potapov product of degree x— 1. 

We need the following elementary facts about the degrees of rational functions. 

Lemma 2.1. Let $ be a square rational matrix function. Then 

deg A det $ < deg A $ 

for every AcC. 

Proof. Clearly, it suffices to consider the case when Ac D. Let d = deg A $ 
and let B be a Blaschke-Potapov product of degree d such that <£>£? has no poles in 
A. Then det $5 = det $ det B has no poles in A. It follows immediately from the 
definition of Blaschke-Potapov products that det B is a scalar Blaschke product 
of degree d. Thus deg A det $ < d. ■ 

Suppose now that $ = ( < ^ 11 ) is a 2 x 2 matrix function. We denote by 



_ V?21 V?22 

<3> ad the adjugate matrix function 



^ad def [ </?22 ~V?21 



— V12 Vn 

Lemma 2.2. Lei $ fte a 2 x 2 rational matrix function. Then 

deg $ ad = deg $. 

Proof. For the unitary matrix J = ^ ^ we have $ ad = J$J*. It is easy 

to see from the definition of the degree that the degree does not change when we 
multiply a matrix function by a constant unitary matrix. ■ 

Lemma 2.3. Suppose that <3>i is an m x n rational matrix function and $ 2 is 
an n x k rational matrix function. Then 

deg A ($ 1 $ 2 ) < deg A $ x + deg A $ 2 

for every AcC. 

Proof. Clearly, it suffices to prove this inequality for AcD. Let us show that 
if \l> is a rational function without poles on T and B is a finite Blaschke-Potapov 
product, then 

deg A (^B)>deg A ^-degR (2.3) 
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Let B+ be a Blaschke-Potapov product of degree deg A (^S) such that ^BB+ has 
no pole in A. Then deg(-B-B^) > deg A Together with the obvious equality 
deg(BB^) =degB + degB* this proves (2.3). 

Similarly, if B is a finite Blaschke-Potapov product, then 

deg A (£M0 >deg A *-deg5. (2.4) 

Let now B\ be a Blaschke-Potapov product of degree deg A $1 such that B\§\ 
has no pole in A and let 5 2 be a Blaschke-Potapov product of degree deg A $2 such 
that §B 2 has no pole at A. Then by (2.3) and (2.4), 

= deg A (B 1 $ 1 $ 2 B 2 ) > deg A ( J Bi$i$ 2 ) - degB 2 > deg A ($i$ 2 ) - deg^i - degS 2 

which proves the result. ■ 

Remark. In fact, Lemma 2.1 in the special case of 2 x 2 functions is an easy 
consequence of Lemma 2.2: 

(deg A det $) 2 = deg A ((det $)/ 2x2 ) = deg A ($ ad $) < deg A $ ad deg A $ = (deg A $) 2 . 



2.3. Thematic factorizations and very badly approximable matrix 
functions. For simplicity, we state here results on badly approximable matrix 
functions in the 2x2 case. Note that all results mentioned in this subsection are 
also valid in the case of matrix functions of an arbitrary size; we refer the reader 
to [Pe] (Ch. 14) for the general case. However, in the case of an arbitrary size 
thematic factorizations look more complicated. 

In [PY1] very badly approximable matrix functions were characterized in terms 
of certain special factorizations (thematic factorizations) . In particular, it follows 
from the results of [PY1] that a continuous matrix function ^ on T is very badly 
approximable if and only if it admits a factorization 

*=f ^ ^Af^O \f TJx V 2 \ (25) 



W 2 W\ J V t\U\ J V — 1> 2 V\ 

where < t± < t , u and u x are badly approximable unimodular (i.e., \u 



\ui\ = 1 almost everywhere on T) functions of class QC = H°° + C f]H°° + C; 

v = ^ ^ ^ and w = ^ ^ ^ are inner and co-outer column functions (i.e., 

Vi, f 2 , wi, and w 2 are H°° functions satisfying |t>i| 2 + |f 2 | 2 = \wi\ 2 + |w 2 | 2 = 1 
almost everywhere on T and such that the functions v 1 and t> 2 are coprime and 
the functions Wi and u> 2 are coprime 1 ). Moreover, in this case to — ^o(^) an( i 
t\ = ti(^/) are the superoptimal singular values of We refer the reader to [Pe], 
Ch. 14, §5. Note also that another (more geometric) description of very badly 
approximable matrix functions is given in [PT]. 

1 Recall that H°° functions are called coprime if they have no common nonconstant inner 
divisor. 



It was also shown in [PY1] that in case ^ is nondegenerate (i.e., rank 1 !' = 2 
almost everywhere on T), the Toeplitz operator T z ^ : H 2 (C 2 ) — > H 2 (C 2 ) has dense 
range (see also [Pe], Ch. 14, Theorem 5.4). 

It was also shown in [PY1] that if \l/ is rational, then all functions in the factor- 
ization (2.5) are rational. Moreover, the following result holds. 

Lemma 2.4. The following inequalities hold: 

degP_w < degP_^, (2.6) 

degv < degP_* - 1, (2.7) 

and 

degw < degP_# - 1. (2.8) 

The proof of Lemma 2.4 is implicitly contained in the proof of Lemma 12.2 of 
Ch. 14 of[Pe]. We give here an explicit proof. 

Proof of Lemma 2.4. Multiplying (2.5) on the left by w l , we arrive at the 
following identity: 

= t u v*. (2.9) 
Applying this equality to the vector function v, we obtain 

w t "^v = t u . 

Let A G D. Since v and w l are analytic in D, we have deg A iu t = deg A i> = 0, and 
so by Lemma 2.3, 

deg A M = deg x (w tl &v) < deg A w t + deg A \1/ + deg A v = deg A ^ . (2.10) 

To obtain (2.6), one has to take the sum in (2.10) over D. 
To prove (2.7), we multiply (2.9) by l/u : 

—w^ = t v*, 
uo 

whence 

1 i 

deg v = deg v* = deg A — w t ^ = deg A — ty. 

AgD U ° AeB U ° 

If A is a pole of then by (2.10), deg A ^ > deg A w , and since u is a scalar 
function, we have 

deg A — * < deg A ^ - deg A u . 
u 

If A is a regular point of then 

1 1 

deg A — # = deg A — = deg 1/A u . 

Uq Uo 
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Hence, for arbitrary A G D, we have 

deg A — * < deg A * - deg A w + deg lfJ u . 

Uq 

Summation over A G D yields 

deg v < degP_\l/ — degP_w + degP + w . 

Inequality in (2.7) follows now from (1.1). To prove (2.8), one has to apply (2.7) 
to the matrix function ■ 

To conclude this subsection, we state one more inequality: 

degP_Wi < degP_^ - /i, 

where \x is the multiplicity of the singular value 1 of the Hankel operator H Uo . This 
is a consequence of Theorem 8.1 of Ch. 14 of [Pe]. Thus 

degP_ui < degP_*- 1. (2.11) 

2.4. Singular values of Hu and Hjj* for unitary-valued matrix func- 
tions U. Let U be a unitary-valued n x n matrix function such that the Toeplitz 
operator T v : H 2 (C n ) — > H 2 (C n ) has dense range. Then the operator H v ,Hu* is 
unitarily equivalent to the restriction of H v Hu to KerTJy (see [Pe], Ch. 4, §4). For 
scalar unimodular functions this result was proved in [PK]. It is easy to see that 

KerTu = {f G tf 2 (C") : \\H v f\\ = ||/||}. 

In particular, if Hjj is compact, then dimKerT^ < oo, and so 

Sj(Hu*) — Sj+^Hu), where /j, = dimKerT^. (2.12) 

In other words, \i is the multiplicity of the singular value 1 of Hjj. This means that 
to obtain the singular values of Hjj* , one has to remove from the singular values 
of Hu all singular values equal to 1. 

Suppose now that \I/ is a continuous very badly approximable function of size 
n x n such that tjfif) = 1, 0<j<n — 1. Then \1> is unitary- valued and the 
Toeplitz operator T 2 * has dense range (see §2.3). Since T 2 * is Fredholm (see 
§2.1), Range T 2 ^> = H 2 (C n ). In particular, for any constant vector c there exists 
a function / G H 2 (C n ) such that T z ^f = c. It follows that / G KerT^,, and so 
dimKerT^ > n. Thus it follows from (2.12) that 

rank Hq,* < rank if ,j, — n. 

In other words 

degP+^ < degP_^ - n. (2.13) 
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3. Upper Estimates 



In this section we are going to obtain sharp estimates of the degree of the su- 
peroptimal approximant of a rational 2x2 matrix function $ in terms of the 
degree of $. As we have observed in § 1, this problem reduces to the problem for 
a very badly approximable rational matrix function to estimate the degree of 
P + \& in terms of the degree of P-^. 

Let $ be a very badly approximable rational matrix function with ti ^ 0. 
Without loss of generality we may assume that t (fy) = \\Hy\\ = 1. Then \1/ admits 
a thematic factorization 

* = ( ™* O ( o° ti(*)«i ) ( % ) (») 

Here u and U\ are badly approximable scalar rational functions, v — ^ ^ ^ and 

w — I Wl I are rational inner and co-outer column functions, and 
\ w 2 J 

' ;? ) - d * - ( r ) < 3 - 2 > 

(see §2.3). 

For a rational matrix function G we are going to use the notation 



G*(() = G(l/() 

and 

G* = (G*)\ 

Clearly, G* and G* are rational matrix functions and 



G*(C) = G(0 for (61 

We consider the parametric family {^jjtx) of very badly approximable matrix 
functions: 

*« = (»* Of"„° !)(%)■ (3.3) 



tu! J \ 

As we have discussed in §2.3, \&m is very badly approximable for t < 1. 

In this section we prove that for all but finitely many points t G (0, 1] we have 

degP+%] <degP_%]-2 (3.4) 

and for all t e (0, 1] we have 

degP+% < 2degP_* [t] -3. 
li 



In the case to(^) 7^ and t 1 (\E') = we prove that 

degP+^ < degP_^ - 1. 

Let $ = f [t] be a rational very badly approximable matrix function such that 
ti(*)^0. By (3.1), 

# [t] = u w*v* + tu^tf. (3.5) 

Obviously, 

det (w* O (0 = det ( 5* ) (0 = 1 (3-6) 

for all C G T. Since the matrix functions in (3.6) are rational, it follows that (3.6) 
holds for all £ G C. Thus we have 

dettt [t] (C)=tuo(C)«i(C), CeC. (3.7) 
We start with the inequality 

degP+^[i] < degP_*[i] - 2 (3.8) 

(see (2.13)). 

Let t G (0, 1]. Suppose that both functions UqW^v* and tui^d 1 in (3.5) have a 
pole at A G C. We say that we have a cancellation at A on the right-hand side of 
(3.5) if 

deg x (u w*v* + tu 1 £$ t ) < max{deg A M iy # v*,deg A -Ui^ t }. 

Since both matrix functions u w#v* and ■ui£$ t have rank one, for a given A, we 
can have a cancellation at A for no more than one t. 

Definition. We say that t is a nondisturbing number for the family {$[<]} (or 
simply a nondisturbing number) if for this t there is no cancellation of poles on the 
right-hand side of (3.5). We say that t is a nondisturbing number for a set A C C if 
there is no cancellation of poles for A G A. Otherwise we say that t is a disturbing 
number (disturbing number for A). 

Note that the function w # v* is analytic in C \ closD, and so if u w#v* has a 
pole at A G C\closD, then A is a pole of u . Similarly, the function is analytic 
in D, and so if tuiffl has a pole at A G D, then u\ has a pole at A. Thus there can 
be at most degP+iio disturbing numbers t for the exterior of the unit disk and at 
most degP„Mi disturbing numbers t for the unit disk. Recall that 

degP+Mo < degP_w - 1 < degP_* [t] - 1 for all t G (0, 1] (3.9) 

and 

degP_Mi < degP_%] - 1 for all t G (0, 1] (3.10) 
(see (1.1), (2.6), and (2.11)). 
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We are going to prove that inequality (3.4) holds for all nondisturbing numbers 
t for the unit disk. If 1 is a nondisturbing number for the exterior of the unit 
disk, then (3.4) is an immediate consequence of (3.8). In the case when 1 is a 
disturbing number for the exterior of the unit disk, we prove that 1 must also be a 
disturbing point for the unit disk and the number of cancelled poles (counted with 
multiplicities) outside the unit disk is at least the number of cancelled poles in the 
unit disk. Clearly, this would imply (3.4) for all t nondisturbing for the unit disk. 

We have from (3.5) 

£*% = tun** (3.11) 

and 

* [t] -&# = tu&. (3.12) 

Multiplying (3.11) by the adjugate matrix y&j^ on the right and multiplying (3.12) 
by \I/| d on the left, we obtain 

(det* [t] )t = turf^fi and (det^ [t] )tf # = tu&f^. 

It follows now from (3.7) that 

Uo C = # t *f ] and u o 0# = *f ] Z. (3.13) 

Lemma 3.1. Let ^ be a matrix function of the form (3.1) such thatt\ = ti(^) 7^ 
0. Suppose that uq has a pole of multiplicity d at A, |A| > 1, and deg A ^ — d — I, 
/ > 0. Then u\ has a zero at A of multiplicity at least I, and both v and w have 
poles at A of multiplicity at least I. 

Proof. By Lemma 2.1, 

deg A det ^ < deg A * = d-l. 

By (3.7), U\ has a zero at A of multiplicity at least /. Note that £*(A) 7^ and 
■# # (A) 7^ (this follows from the fact that v and w are co-outer). Thus deg A u £* = 
deg A Uq{># = deg A u = d. On the other hand, by Lemma 2.2, deg A \l/ ad = deg A ^ = 
d — I. Hence, by (3.13), deg A £ > / and deg A # > /, i.e., both v and w have poles 
at A of multiplicity at least /. ■ 

Put 

J= fo -1 

and 



1 
A = wv\ 



We have 

* [t] = u A* - tu x JAJ. (3.14) 
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Lemma 3.2. Let |A| > 1. Suppose that for some t < 1, 

deg A * [t] - deg A $> {1] = I > 0. 

Then for allt < 1, 

deg 1/A * [t] - deg 1/A = I > 0. 



(3.15) 



(3.16) 



Proof. We have already observed that since both matrix functions uqW#v* and 
Ui^i?* have rank one, it follows from (3.5) that if (3.15) holds for some t G (0, 1), 
then it holds for all t G (0, 1). The same is true for (3.16). 

Let d be the multiplicity of the pole of u at A. Since 1 is a disturbing number 
for the point A, we have 



Note that the vector functions w#, v* are analytic and nonzero for |A| > 1. There- 
fore 

deg A * [t] = deg x (u w*v*) = deg A u = d for all t^l 
and at the disturbing number the degree drops by I: deg A = d — I. 

Since for a rational function u unimodular on T, we have w # = 1/u, it follows 
from (3.14) that 

Vt = utA - tut J A* J = — (u\A - tu JA*j) = — JV im J. (3.17) 

ltJ U Ui v ' UqUi l 1 1 

By Lemma 3.1, the function at A cannot have a zero of multiplicity greater 
than d — l. Thus it follows from (3.17) that 1 is a disturbing number for the family 
for the point A and the multiplicity of the pole of at A must also drops 
by I. This means that t — 1 is a disturbing number for the family {^[t]} for the 
point 1/A and the multiplicity of the pole of y&m at 1/A must also drop by /. ■ 

Theorem 3.3. For all nondisturbing numbers t G (0, 1] for the unit disk in- 
equality (3.4) holds. There can be at most degP_Wi numbers t G (0, 1), for which 
inequality (3.4) can be violated. 

Proof. Recall that for all nondisturbing numbers t, deg A ty[t] remains the same. 
Let us first consider the case when 1 is a nondisturbing number for the family 
{^[t]} for the exterior of the unit disk. Then for all nondisturbing numbers t for 
the unit disk we have 



deg x {u w*v*) = deg A (Mi^*). 



degc\ D *[t] = degP+% < degP + * [1] , 
deg D tf M = degP_% > degP^j. 



Therefore inequality (3.8) yields 

degP_%] - degP + %] > degP_*[i] - degP+W^ > 2 
for all nondisturbing numbers t for the unit disk. 
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In the case when 1 is a disturbing number for the family {&[t]} for some A, 
|A| > 1, we use Lemma 3.2. Let A be the set of all points A satisfying the hypotheses 
of Lemma 3.2, and put A # = {1/A: A G A}. Taking the sum in (3.15) and (3.16) 
over A yields 

deg A# * [t] - deg A * [t] = deg A# - deg A 
For all A G C \ (D U A) the number 1 is nondisturbing, whence 

de Sc\(DUA) *[*] ^ de gc\(BUA) *[!]■ 

And finally, if t is a nondisturbing number for the unit disc, then we have 
degP_%] - degP + tf [t] = (deg D \ A# * [t] - deg evDuA) * [t] ) + (deg A# tf [t] - deg A * [t] ) 

> ( d eg D \A # ^[i] - degcvDuA)^!]) + ( de gA # ^[i] - deg A %]) 
= deg -deg P+%] > 2. 

Let us estimate now the amount of disturbing numbers for the unit disc. Since 
the vector functions £ and i?* are analytic in D, the equality 

*[ti] - = (*i " t 2 )u 1 ^ t 

implies that the multiplicity of the pole at a point A G © can drop for some t only 
if A is a pole of u\. Definitely, each pole of u\ can produce at most one disturbing 
number. Thus the total amount of disturbing numbers does not exceed the number 
of poles of U\ in O, what is no more than degP_Ui. ■ 

The following result gives us a sharp estimate for degP+\l/ in the general case. 
The example given in the next section shows that this estimate is the best possible. 

Theorem 3.4. Let be a very badly approximable rational 2x2 matrix function 
such that degP_^ = k > 0. Ifh(^) = 0, then 

degP+^ < k - 1. 

^ 0, then 

degP + ^ < 2k - 3. 

Proof. Consider first the case when ti(^) = 0. Then \I/ can be represented as 

* = (»* o( o o ) ( % ) = "° w * v '' 

where u is a scalar badly approximable function, v and w are inner and co-outer 
column functions, and £ and d are defined by (3.2). By (3.9), 

degP + «o < degP_w < k. 

Since ranku w#v* = 1 and both w* and v* have poles only in D, it follows easily 
from Lemma 2.3 that 

degP+^ = degP + u w*v* < degP + w < k - 1. 
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Suppose now that ti(^) 7^ 0. Consider a nondisturbing number t G (0, 1). By 
Theorem 3.3 

degP+% < degP_* [t] -2. 

By (3.5), 

= * [tlW] = * [t] + (*x(*) - t)^\ (3.18) 

Since t is nondisturbing, we have 

degP + ^ < degP+%] < degP_%] - 2. 

On the other hand, it follows from (3.18) that 

degP„^ > degP_%] - degP-Uiftf* > degP_* [t] - degP_wx 

because both £ and i?* are analytic in D. Thus 

degP+^ - degP_^ < degP_«i - 2. 

The desired inequality follows from (3.10). ■ 

Theorem 3.5. Let $ be a rational 2x2 matrix function such that deg <E> = k > 
and P_$ ^ 0. //ti($) = 0, toen 

deg .4$ < fc - 1. 

//*!($) ^ 0, toen 

deg„4$ < 2k - 3. 

Proof. It suffices to apply Theorem 3.4 for the matrix function \& = $ — „4$. 
Indeed, Theorem 3.4 yields 



deg(P+$ - A<S>) < 



degP_$-l if t 1 ($) = 0, 
2degP_$-3 if ti($) ^ 0. 



Substituting this in the inequality deg A§ < degP+$ + deg(P + $ - A$), we get 
deg A$ < 



degP + $ + degP_$-l = deg$-l if ti($) = 0, 

degP+$ + 2degP_$ - 3 < 2deg$ - 3 if ^ 0. 



Theorem 3.4 shows that for every t G (0, 1), 

degP+%] + 2 - degP_* [t] < degP_«i. 

The next theorem improves this result. It shows that on the left-hand side of 
the above inequality we can take the sum over all t G (0, 1]. 
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Theorem 3.6. Let {^{t}}, t G (0, 1], be the family of very badly approximable 
matrix functions defined by (3.3). Then 

[degP+% + 2-degP_* [t] ] + <degP_«i. (3.19) 

t€(0,l] 

Recall that [x] + = max{a;,0}. 

Proof. By Theorem 3.3, only finite set of disturbing points t±, t2, ■ • ■ , t n e (0, 1), 
n < degP_ui, for the unit disk can produce a nonzero term in (3.19). Denote by 
Aj the subset of D such that tj is a nondisturbing number for D \ Aj and it is a 
disturbing number for every point A in Aj. Take any nondisturbing number t and 
use the same argument as in the proof of Theorem 3.4. The only difference is the 
following: we estimate the difference of the degrees of Wfc] and ty[t] not for the 
whole unit disk, but for each Aj separately. The identity 

%■] = *M + (*j - 

implies that 

deg A . > deg A . * [t] - deg A . u v 
Since tj is nondisturbing for D \ Aj, we have the following identity for this set: 

deg B \ Aj = deg D \ A . * [t] . 
Thus together with the previous inequality, we obtain 

degP_%.] > degP_* [t] - deg A . Ul . 
Since t is nondisturbing, we have as before: 

degP+%,] < degP+%] < degP_^ [t] - 2. 

Therefore 

degP+%.] + 2 - degP_%.] < deg A . Ul . 
Taking the sum over j, we obtain (3.19). ■ 

We will see in Theorem 4.4 that inequality (3.19) is best possible in the sense that 
if x is an integer-valued nonnegative function on (0, 1] that can be nonzero only 
at finitely many points, then there is a family of rational very badly approximable 
functions ^ of the form (3.3) such that 

[degP + * M + 2 - degP_* [t] ] + = Kit) 

and 

x(t) = degP_Ui. 

te(o,i] 
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4. Counterexamples 



In this section for k > 2, we construct an example of a very badly approximable 
2x2 rational matrix function * such that degP_^ = k, but degP + ^ = 2k - 3. 
We are going to look for \1> in the form (3.3). It follows from Theorem 3.3 that t 
must be a disturbing number for the unit disk. 

We also construct an example of the family \I/[ t ] of the form (3.3) such that 
inequality (3.4) can be violated for several values of t e (0, 1]. Actually, it can be 
violated for precisely degP_Wi values of t which shows that Theorem 3.3 cannot be 
improved. Moreover, the same construction shows that the inequality in Theorem 
3.6 cannot be improved. 

The following result gives a necessary condition for the existence of a disturbing 
number for the unit disk that reduces the multiplicity of a pole at a given point of 
D. Its proof is similar to the proof of Lemma 3.1. 

Lemma 4.1. Suppose that a point A G D is a pole of ui of multiplicity d > 
and the multiplicity of the pole of^> at X is d — l, where I > 0. Then uq has a zero 
at A of multiplicity at least I and both v # and w# have a pole at A of multiplicity 
at least I. 

Proof. It is easy to see that 

w t ^ = u v* (4.1) 

and 

^/v = u w*. (4.2) 

Since by Lemma 2.1, deg A det ^ < d — I, it follows from (3.7) that u has a zero 
at A of multiplicity at least /. 

Multiplying (4.1) on the right and (4.2) on the left by the adjugate matrix ^l/ ad 
and using (3.7), we get 



tuiw* = v*^ ad (4.3) 

and 

t Ul v = y &d w*. (4.4) 

By Lemma 2.2, deg A ^ ad = deg A ^. Since v(X) ^ and w t (X) ^ (their compo- 
nents are coprime!), the expressions in these equalities have a pole of multiplicity 
d at the point A. Therefore both v* (and hence i>*) and to* have poles at A, of 
multiplicity at least I. ■ 

We can try to construct a desired example of a function for which all poles 
have multiplicity one. To construct such an example, we have to cancel k — 1 poles 
of tui^ in D. Note that degP_wi < k — 1 (see §2.3). Thus we have to cancel 

18 



all poles of tui^-d 1 in D. By Lemma 4.1, u must have zeros at all poles of U\ in O 
and both v* and w* must have poles at all poles of u 1 in D. 



Theorem 4.2. Let k > 2 be an integer, < t < 1, and a > 1. Suppose that B , 
Bi, and B 2 are Blaschke products such that 

deg B — k — 1, deg B\ = k, deg B 2 = k — 2, 

and the function 1 — ta 2 B\B 2 vanishes at the zeros of B . Let 



'Bo 



T/jen degP_^ = fc, wMe degP + ^ = 2fc — 3. 



Remark. On the right-hand side of formula (4.5) the first and the third matrix 
functions are not as in the definition of thematic factorizations (see (2.5)). How- 
ever, if we multiply the first and the third matrix functions on the right-hand side 
of (4.5) by (1 + a 2 ) 1 / 2 , we obtain matrix functions as in the thematic factorization 
(2.5). Thus the function \1> defined by (4.5) is very badly approximable. 

Proof. It is easy to verify that 



* = 



l=|^a % + t aB 2 



-%-taB 2 B^ + tB 2 

Let us first compute degP.^. By the hypotheses of the theorem, P_\P has 
removable singularities at the zeros of B . Thus P_\l/ has poles only at the zeros 
of Bi. Hence, 



degP_# =deg Bi -i (0) 



B Bi 

a_ 

Si 




We have 



Clearly, 



and so 



a 



S Si B 
Bi " Si 




a 



~ -a 2 W / B i\-fl ~a 2 B 



i 



So a 



rank | "° = 1, 

-a -a z B 



degP_^ = deg^i = fc. 

Let us now compute degP + \l/. Clearly, P + \l/ can have poles only at the poles of 
B 2 and the poles of B . 

Let us first show that there can be no cancellation of poles at the poles of B 2 or 
Bq. The only possibility of cancellation is to cancel poles of B of the lower right 
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2 

entry which can be done if — j^- + tB 2 vanishes at poles of B . Let ( be a zero of 
Bq and suppose that 

It follows that 

a 2 B 1 ({)B 2 ({)=t. 

Since t < 1, this contradicts the assumption that ta 2 Bi(()B 2 (() = 1. 

First, we compute the contribution of the poles of B . Clearly, only the lower 
right entry of P+\& has poles at poles of B . Obviously, 

de &B-\oc) * = * - 1. 
Let us compute the contribution of the poles of B 2 . We have 



ta 2 B 2 \ / -ta 2 ta 



Clearly, 



-ta A 



ta 



rank | B ° } = 1, 

-ta tB 



and so the contribution of the poles of B 2 is degB 2 = k — 2. Thus 

degP+^ = k-l+k-2 = 2k-3. I 

Corollary 4.3. Let ^ be the function defined in Theorem 4.2 and let $ = P_\l/. 
Then deg $ = A; and deg^l<l ) = 2 A; — 3. 

Finally, we can slightly modify the construction in Theorem 4.2 to get a family 
^/[t] of the form (3.3) such that inequality (3.4) is violated for several different 
values of t e (0, 1). 

The following result shows that roughly speaking, inequality (3.19) is the only 
constraint for the numbers [degP+^t] + 2 — degP_\I/[ t ]] + . 

Theorem 4.4. Let k, a, B , Bi, and B 2 be as in Theorem 4.2. Suppose that 
ii, • • • ,td are distinct points in (0,1), Ai,--- , A<j is a partition of the zero set 
of Bq, and Xj is the number of zeros of Bq in Aj (counted with multiplicities). 
Assume that 

1 — tja 2 BiB 2 vanishes at the points of Aj, 1 < j < d. (4.6) 
If the very badly approximable function ^[ t ] is defined by 

--(*0(ti)(iO- 
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then inequality (3.4) is violated precisely for t — t±,- • • ,td and 

degP+%.] = degP_^ te] - 2 + x,, 

The proof of Theorem 4.4 is exactly the same as the proof of Theorem 4.2. 
In particular, if each Aj is a singleton, inequality (3.4) is violated for precisely 
degP.Mx values of t. 

Remark. Let us fix the zeros of B . Condition (4.6) means that the Blaschke 
product BiB 2 interpolates at the points of Aj the value (a 2 tj)~ 1 . It is well known 
(see e.g., [Pe], Ch. 7, §1) that if a is sufficiently large, it is possible to find a 
Blaschke product of a given degree greater than or equal to k that solves this 
interpolation problem. Thus if a is sufficiently large, it is possible to find B\ and 
B 2 satisfying the hypotheses of the theorem. 
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